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Abstract — We study a class of discrete memoryless broadcast 
interference channels (DM-BICs), where one of the broadcast 
receivers is subject to the interference from a point-to-point 
transmission. A general achievable rate region TZ based on 
rate splitting, superposition coding and binning at the broad- 
cast transmitter and rate splitting at the interfering transmit- 
ter is derived. Under two partial order broadcast conditions 
interference-oblivious less noisy and interference-cognizant less 
noisy, a reduced form of TZ is shown to be equivalent to the region 
based on a simpler scheme that uses only superposition coding 
at the broadcast transmitter. Furthermore, the capacity regions 
of DM-BIC under the two partial order broadcast conditions 
are characterized respectively for the strong and very strong 
interference conditions. 

I. Introduction 

Broadcast channel and interference channel are two impor- 
tant classes of multi-user channels that have drawn consider- 
able research attention in the past few decades, mostly due 
to their simplicity as a fundamental building block and their 
close relevance to practical communication networks. While 
complete characterizations are not available, there have been 
significant advances on these topics in the information theory 
literature. Notably the best general achievable schemes for the 
two channels are respectively given by Marton [1] and Han- 
Kobayashi [2], which are capacity achieving for some subclass 
channels or under various conditions, such as the ones in fl3]- 
0. 

Motivated by an recent interest in a heterogeneous cellular 
network design paradigm [7 1, we explore a multi-user channel 
that combines the broadcasting and interference features, i.e. 
broadcast interference channel (BIC). Specifically we envision 
a communication scenario where a macro base station (BS) 
broadcasts to two macro users, one of which is interfered by a 
point-to-point transmission from a femto BS to a femto user. 
While the BIC studied presents a simplified version of what 
might happen in practice, we believe that a fundamental under- 
standing of this simpler channel is crucial for characterizing 
the trade-offs in heterogeneous networks. 

Variations of BIC have been previously studied by Shang 
and Poor in [8|, for a different interference profile where 
interference is from the broadcast transmitter to the point-to- 
point receiver, and in (9), for the Gaussian BIC where both 
of the broadcast receivers are subject to interference. Even 
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though the channel studied in this paper has a more restrictive 
interference profile than that in H, we address the more 
general discrete memoryless channel and provide more general 
classes of common strategies as well as capacity regions under 
some conditions. Specifically, we derive an achievable rate 
region TZ based on rate splitting, superposition coding and 
binning at the broadcast transmitter and rate splitting at the in- 
terfering transmitter. This region is a natural generalization of 
Marton's region [1] for a DM-BC. We then define two partial 
order broadcast conditions, interference-oblivious less noisy 
and interference-cognizant less noisy. Under these conditions, 
a reduced form of TZ is shown to be equivalent to the region 
based on a simpler scheme that uses only superposition coding 
at the broadcast transmitter. Furthermore, if interference is 
strong for the interference-oblivious less noisy DM-BIC, the 
capacity region is given by the aforementioned two equivalent 
rate regions. Interestingly, for the interference-cognizant less 
noisy DM-BIC, we argue that the strong but not very strong 
interference condition does not exist and in this case, we obtain 
the capacity region for the very strong interference. 

This paper is organized as the follows. The channel model 
is introduced in Section II, followed by the derivation of TZ 
in Section III. For DM-BIC with two partial order broadcast 
conditions, the equivalence of rate regions is presented in 
Section IV and the capacity regions are derived in Section 
V. This paper is concluded in Section VI. 

Notation: Let <f> denote a constant. The notation convention 
follows iflQl 

II. Channel Model 

A discrete memoryless broadcast interference channel is 
denoted by {X\ x X 2 ,p(yi, V2, ya\xi, x 2 ),yi x y 2 x y 3 ), 
where X. L , i = 1.2, are the input alphabets, 3^, j — 1,2,3, 
are the output alphabets and p(yi,y 2 ,y3\xi,x 2 ) is the chan- 
nel transition probability. In this paper, we concentrate on 
a specific interference profile, where p(yi, y2, 2/3 1 ^ x 5 x 2 ) = 
p{y\\x\)p{y 2 \x\, x 2 )p{yz\x2). As shown inFigQ] while trans- 
mitter 1 wishes to broadcast to receivers 1, 2 , the second 
receiver is interfered by transmitter 2 who wishes to commu- 
nicate with receiver 3. 

Definition 1: A (Mi, M 2 , M3, n) code consists of message 
sets Wj — {1, Mj}; two encoding functions X\ : (Wi x 



W 2 ) -»• #[\ X 2 



W3 — > X2 and three decoding functions 
= 1,2.3. 
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Fig. 1: Channel Model 



The messages Wj are uniformly from Wj. The average error 
probability for the (Mi, M 2) Ms, n) code is 

P e = Pr(. 9 i(YT) ?4 or 52 (F 2 ") ? W 2 or .g 3 (r 3 ") ^ W 3 ). 

Definition 2: Rates of a (Mi, M 2 , M 3 , n.) code are defined 



as R 



_ log 2 (Mj ) 



for j = 1,2, 3. 



Rates (i?i, i?2, i? 3 ) are said to be achievable if there exists a 
sequence of (Mi, M 2 , M 3 , n) codes with P e — > as n — > 00. 
An achievable rate region is the set of all achievable rates for 
a given coding scheme. The capacity region is the closure of 
the union of all achievable rate regions. 

III. Achievable Rate Region for a General DM-BIC 

In this section, we derive an achievable rate region for a 
general DM-BIC, where the broadcast transmitter employs rate 
splitting, superposition coding and binning and the interfering 
transmitter employs rate splitting. 

Theorem 1: TZ is an achievable rate region for DM-BIC, 
where TZ is the closure of all (Ri, i? 2 , R3) satisfying 

Ri<I{Vi;Yi\Q) 
R 2 <I(V 2 ;Y 2 \U 2 ,Q) 
R 3 <I(X 2 ;Y 3 \Q) 
Ri + R2 < I{ Vx ; Yi I (7i , Q) + I( V 2 ; Y 2 \ U 2 , Q) - 

/(Vi;F 2 |t/i,Q) 
Rx+R 2 < I(yi;Yi\Q) + I(ya;Y2\Ui,Ua,Q)- 

I(Vi;V 2 \Ui,Q) 
R 2 +Rs< I(V 2 ,U 2 ;Y 2 \Q) + I(X 2 ;Y 3 \U 2 ,Q) (D 
Ri +R 2 +R S < I(V 1 ;Yi\U 1 ,Q)+I(V 2 ,U 2 ;Y 2 \Q)+ 

I{Xr,Y a \Ua,Q)-I(Vi;V a \Ui,Q) (2) 
Ri +R 2 +R S < I(V 1 ;Yi\Q) + I(V 2 ,U 2 ;Y 2 \Ui,Q)+ 

I(X 2 ;Y 3 \U 2 ,Q) - I(Vi;V 2 \Ux,Q) O) 
Ri,R 2 ,Rs > 

for some function X\ = f(Ui, V%, V2) and joint distribution 

'Pq,u 1 ,Vi,V2,U2,x 2 

= p{q)p{ui\q)p(v 1 \q,u 1 )p(v 2 \q,u 1 )p(u 2 \q)p(x 2 \u 2y q), 

such that 

I(V 1 ;Y 1 \U 1 ,Q) + I{V 2 ;Y 2 \U 1 ,U 2 ,Q)-I{Vi;V 2 \U 1 ,Q)>0. 

(4) 

Proof: The proof is relegated to App. [A] Here we provide 
a sketch. The messages for receivers 1, 2 are split into common 
and private parts respectively. Common messages are carried 
by the cloud signal Ui, which is decoded at both Y\ and Y 2 . 
The private message carriers Vi, V2, which are only decoded 
at their respective intended receivers and treated as noise 



elsewhere, are superimposed upon U\, where binning is used 
to allow arbitrary dependence between V\ and V 2 - At the 
interfering transmitter, rate splitting is employed to alleviate 
interference by receiver 2 decoding the common signal U2 
while treating the private as noise. ■ 

Remark 1: Constraint on the choice of joint input 
distributions is a direct consequence of the nonnegativity of 
some intermediate rates, which are eventually eliminated using 
Fourier-Motzkin procedure. 

Remark 2: With U x = V x = <f>, X x = V 2 and R x = 0, TZ 
reduces to the compact Han-Kobayashi region [12] for a one- 
sided interference channel. With X 2 = U 2 = <f> and R 3 = 0, 
TZ reduces to the most general form of Marton's region with 
private message sets for a general DM-BC ifTTl . Notice that 
with U 2 = <f>, the constraint reduces to 

J(Vi ;Y 1 \U 1 ,Q) + I(V 2 ;Y2\U 1 ,Q)- J(Vi ;V 2 \U 1 ,Q) > 0, 

(5) 

which applies to Marton's region. However a closer ex- 
amination reveals that (0 is unnecessary. For some joint 
distribution that violates Q, we have Ri,R 2 < where 
/ = min{/(C/i; Yi),/(C/i;y 2 )}- Clearly (R U R 2 ) is contained 
in Marton's region for some other joint distribution that 
satisfies (0). Therefore removing constraint (0 does not really 
enlarge the region. 

IV. Equivalence of Rate Regions for the DM-BIC 
under Partial Order Broadcast Conditions 

Here we concentrate on DM-BIC under two partial order 
broadcast conditions: interference-oblivious less noisy and 
interference-cognizant less noisy, which will be defined next. 

Definition 3: In a DM-BIC, receiver 2 is said to be 
interference -oblivious less noisy than receiver 1, denoted by 
Yi < Y 2 , if /(C/i;Yi) < I(U i; Y 2 ) for all p(u 1 ,x 1 )p(x 2 ) 
such that Ui -> (X U X 2 ) -)■ (Yi,Y" 2 ). 

Definition 4: In a DM-BIC, receiver 1 is said to be 
interference-cognizant less noisy than receiver 2, denoted by 
Yi y c Y 2 , if 7(£/i;Yi) > I{Uf, Y 2 \X 2 ) for all p(u llXl ) P {x 2 ) 
such that Ui (X U X 2 ) {Y U Y 2 ). 

Remark 3: We can interpret Yi Y 2 as the follows: 
receiver 2 is less noisy than receiver 1 (4), even though no 
particular action is taken by receiver 2 to deal with interfer- 
ence. Similarly, Yi >- c Y 2 can be interpreted as the follows: 
even if interference X 2 is provided to receiver 2, receiver 
1 is still less noisy. Also note that degradedness (physical 
or stochastic, which are the same in broadcast channel [10. 
Theorem 15.6.1]) implies the partial order conditions and 
hence is stricter. For example, Y 2 being degraded with respect 
to Yi, i.e. X\ — > Y\ — » Y 2 holds for all p(xi,x 2 ), implies 
Yi ^ c Y 2 , but not vice versa. 

The first class of schemes we consider is a specialization 
of TZ, given in the following two Corollaries. 

Corollary 1: TZ\ is an achievable rate region for DM-BIC 
with Yi -< Y 2 , where TZi is the closure of all (Ri, R2, R3) 



satisfying 



satisfying 



Ri <I(Ui;Yi) 
R 3 <I(X 2 ;Y 3 ) 
Ri + R 2 < I(Ui; Yi) + /(Xi; Y 2 \Ui,U 2 ) 
Ri + R 2 + Ra < I(Uv, Yi) + I(X U U 2 ; Y 2 \Ui)+ 
I(X 2 ;Y 3 \U 2 ) 
Ri,R 2 ,R 3 > 



(6) 
(7) 



for some V Uu x u u 2 ,x 2 = p{ui)p{xi\u 1 )p{u 2 )p{x 2 \u 2 ). 

Proof: Fix Q = cj>. Specializing 1Z with X% = V 2 , V\ = 
U\ and removing redundant inequalities due to Y\ -< G Y 2 , we 
obtain IZi. ■ 
Corollary 2: 1Z 2 is an achievable rate region for DM-BIC 
with Y\ )~ c Y 2 , where 1Z 2 is the closure of all (R~i,R 2 ,R 3 ) 
satisfying 



R 2 <I(Ur,Y 2 \U 2 ) 
Rz<I{X 2 ;Y z ) 
Ri+R 2 < I(Xx;Yi\Ux) 
R 2 + R 3 <I(Ui,U 2 ;Y 2 ) 
R 1 +R 2 +R a <I(X 1 ;Y 1 \U 1 ) 
Ri,R 2 ,R 3 > 



■I(Ui;Ya\U 2 ) 
■ I(X 2 ;Y 3 \U 2 ) 
I(U U U 2 ;Y 2 ) 



(8) 

(9) 
(10) 

I(X 2 ;Y 3 \U 2 ) 



for some Vu x ,x u Vi,x?. = p(ui)p(xi\ui)p(u 2 )p(x 2 \u2). 

Proof: A direct specialization of 7Z will result in some 
extra inequalities that are harder to remove. Hence we take 
an indirect approach, where the specialization is done for an 
equivalent region of 7Z. The details are provided in App. [B] 

■ 

Notice that to derive IZi, i — 1,2, we fix the time- 
sharing r.v. Q. In principle, we could have kept Q intact when 
specializing 1Z, but the following proposition asserts that there 
is no benefit doing so. Since time-sharing always results in a 
region no smaller than convex hull operation, it follows that 
taking convex hull is also unnecessary. 

Proposition 1: Time-sharing does not enlarge IZi, i = 1,2. 
Proof: The proof is relegated to App. [C] ■ 

Next we present two achievable rate regions, IZny i = 1, 2, 
which are solely based on superposition coding (i.e. no rate 
splitting at the broadcast transmitter), where the cloud center 
carries only receiver i's message. Since the proofs are stan- 
dard, they are omitted for conciseness. 

Theorem 2: TZ-m is an achievable rate region for DM-BIC 
with Y\ < Q Y 2 , where TZm is the closure of all (Ri, R 2 , R3) 
satisfying 

Ri < I(Ui;Y x ) 
R 2 < I{Xi;Ya\Ui,Ua) 
R 3 <I(X 2 ;Y 3 ) 
R 2 +Rz< I(X!,U 2 ;Y 2 \Ui) + I(X 2 ;Y S \U 2 ) 
Ri , R 2 , R3 > 

for some Pui,Xi,u 2 ,x a = p{ui)p{xi\ui)p{u 2 )p(x 2 \u 2 ). 

Theorem 3: TZ( 2 ) is an achievable rate region for DM-BIC 
with Y\ y c Y 2 , where TZ/2) is the closure of all (Ri, R 2 , R3) 



Ri </(Xi;Yi|{/i) 
R2 < I{Uv,Y 2 \U 2 ) 
R 3 <I(X 2 ;Y 3 ) 
R 2 +R 3 <I(U 1 ,U 2 ;Y 2 ) + I(X 2 ;Y 3 \U 2 ) 
Ri,R 2 ,R 3 > 

for some Vu u x u u^x 2 = p(ui)p(xi\u 1 )p(u 2 )p(x2\u2). 

In deriving the most general region 1Z, we used rate 
splitting, superposition coding and binning at the broadcast 
transmitter. Regions IZi, i = 1,2, are derived from 1Z when 
binning is stripped off but rate splitting and superposition 
kept intact. While both IZi and TZ^ rely on superposition 
coding, there is a subtle difference. Despite the fact that both 
schemes' cloud centers carry receiver i's message, the one 
for IZi could also carry receiver j's (j ^ i, j = 1, 2) common 
message, which could be potentially helpful to reduce the self- 
interference due to the fact that part of the broadcast signal 
intended for receiver j is essentially interference from receiver 
i's perspective. It is apparent that the superposition-only-based 
rate regions are not larger than the ones based on superposition 
and rate splitting, since the latter includes the former as a 
special case. This can be also verified by explicitly checking 
that the inequalities defining TZuj induce those in IZi, but not 
vice versa. Hence at first sight it seems that IZi is strictly larger 
than lZ(i) ■ However, if we consider the no interference case, 
i.e. U 2 — X 2 — <p, i?3 = 0, IZi cannot be strictly larger than 
IZu) since the latter is the capacity region of a less noisy (or 
degraded) DM-BC 1111 . The pitfall of the previous argument is 
that it only considers a specific input distribution. It is true that 
for some given T , Ui,x 1 ,U2,x 2 i is strictly larger, however 
once we consider all Vu 1 ,x 1 .u 2 ,x 2 > we w iU show that they 
are indeed equivalent. 

Theorem 4: IZi = TZ(i), i — 1,2. 

A. Proof of Theorem [4] 

Before proving Theorem @] we need the following defini- 
tions and lemmas. 

Definition 5: Let R™ be a convex subset of R™, a n- 
dimensional Euclidean space. A point X e M™ is an extreme 
point (ExP) iff whenever X = tY + (1 - t)Z, t G (0, 1) and 
Y ^ Z, this implies either Y W> or Z g R™ 

Definition 6: An ExP X € M" is said to be dominant 
(DExP) iff there does not exist another ExP Y G K™, Y ^ X, 
such that X < Y element-wise. 

Remark 4: In the literature, the term "dominant extreme 
points" are sometimes referred as corner points. The intention 
of choosing the former terminology is to emphasize the 
connection to convex set. 

Let TZ n be a n-dimensional convex rate region, of which the 
set of all DExPs is denoted by Q. Further let co(tt) denote 
the convex hull of ft: 



C0(Q) = < ^2 a iR-i 



Ri en,a t e [0,1],^ a* = 1,771 = 1,2, 



Lemma 1: R 6 lZ n iff there exists some R' G co(f2) such 
that R < R' element-wise. 

Proof: For the "if" part, since DExPs are achievable, so 
is their convex combination, specifically R' is achievable. If 
R < R', R is also achievable. For the "only if" part, for a 
convex region 7Z n , any achievable rate can be expressed as a 
convex combination of some ExPs, i.e. there exists some Ri G 
fy, on S [0, 1] and an integer m such that R = Yli=i 
where "J denotes the set of all ExPs for lZ n . Now replacing 
any non-dominant R^ that constitutes R by its corresponding 
DExP and keeping convex coefficients intact, we obtain 
R' e co(fi), where R < R. ■ 

LemmaQ]suggests that a rate region is completely described 
by its DExPs. When comparing different rate regions, it 
suffices to consider their sets of DExPs, which will be given 
in the follows for IZi and IZ^, i = 1, 2, respectively. 

Lemma 2: For a Vu 1 ,x 1 ,u 2 ,x 2 , DExPs of 1Z 2 are given by: 

A= (l(Xv,Yx\Ux), I(Uy,Y 2 \U 2 ), min{I(X 2 ;Y 3 ), I{U 2 ;Y 2 )+ 
I{X 2 ;Y 3 \U 2 )}) 

B= (/(Xi;Yi|t/i), min{/(l/ i; Y 2 |l/ 2 ), [I(U U U 2 ; Y 2 ) - I(U 2 ; Y 3 )]+}, 

I(X 2 ;Y 3 \U 2 ) +min{I{U 2 ;Y 3 ), I(Ui, U 2 ; Y 2 )}) 
C= ( y I(X 1 ;Y 1 \Ui) + mm{I(U 1 ;Y 2 \U 2 ), [I(Ui, U 2 ; Y 2 ) - I(U 2 ; Y 3 )}+}, 

0, I(X 2 ;Y 3 \U 2 ) + mm{I(U 2 ;Y 3 ), I(Ui, U 2 ; Y 2 )}) 
D=(l{X 1 ;Y 1 \U 1 )+I{Ui;Y 2 \U 2 ), 0, min{/(X 2 ; Y 3 ), I(U 2 ;Y 2 )+ 

I(X 2 ;Y 3 \U 2 )}) 

Proof: The proof is relegated to App. [D] ■ 

Lemma 3: For a Vu 1 ,x 1 ,u 2 ,x 2 , ^(2) has two DExPs A, B 
as in Lemma [2] 

Proof: The proof is relegated to App. [E] ■ 
Lemma 4: For a P^Xi,^,^' DExPs of 1Z\ include 

E= (/(J7 i; Yi), /(Xi;Y 2 |C/i,C/ 2 ), min{/(X 2 ;Y 3 ), 7(C/ 2 ; Y 2 |l/l)+ 

/(X 2 ;Y 3 |C/ 2 )}) 
F= (/(C/i;Yi), min{/(X i; y 2 |l/i,(7 2 ), [/(Xj, Ua; Y 2 |C/i)- 

/(C/ 2 ;y 3 )] + }, /(X 2 ;Y 3 |[/ 2 ) + min{/(£/ 2; y 3 ), 

7(Xi,C/ 2; y 2 |(7i)}) 

G=(o, /(C/i;Yi)+/(Xi;Y 2 |[/i,C/ 2 ), min{/(X 2 ; Y 3 ), /(C/ 2 ; Y 2 |£/i) 

+ /(X 2 ;Y 3 |C/ 2 )}). 

Furthermore, if Vu x ,x u Ui,x 2 satisfies 7([/ 2 ;Y 3 ) < 
7(t/i; Yi) + 7(Xi, U 2 ; Y 2 \Ui), there are two more DExPs 

H=(o, /(C/i;Yi) + /(Xi;y 2 |C/i,C/ 2 ) + min{0, Z(J7 2 ; Y 2 \Ui) 

-I(U 2 ;Y 3 )}, /(X 2 ;Y 3 )) 
/= (j(t/i;Yi) + min{0, /(X,, C/ 2 ; Y 2 |C/i) - /(C/ 2 ; Y 3 )}}, 

[/(Xi;y 2 |(7i,C/ 2 ) + min{0, 7(E/ 2 ; Y 2 |C/i) - I(U 2 ; Y 3 )}] + , 
/(X 2 ;Y 3 )}). 

Otherwise if I(U 2 ; Y 3 ) > I{Ux\Y x ) + I(X U U 2 ; Y 2 \Ui), there 
is one more DExP 

J= (0, 0, /(£/ 1 ;Y 1 ) + J(X 1 ,C/ 2 ;Y 2 |J/ 1 ) + I(X 2 ;Y 3 ia 2 )). 



Proof: The proof is relegated to App. [F] ■ 

Lemma 5: For a Vjji,Xi,u a ,X2t has two DExPs F, 7 1 
as in Lemma |4] 

Proof: The proof is exactly the same as Case 1 for IZi. 

■ 

Proof of Theorem® We will first prove 1Z 2 = TZ( 2 ) and 
then = TZ (1 y Let P = V Ul ,x 1 ,u 2 ,x 2 - We use "P(y 1= to 
denote the same distribution except that Ui = <fi. 

1. Proof of n 2 = n {2) 

From Lemma [2] and [3] for a given "P, 1Z 2 has two more 
DExPs than H( 2 ). However, for Vu 1 =U 2 =(l>' A becomes A' = 
( I{X 1 ;Y 1 ), 0, I(X 2 ;Y 3 ) ) and it can be shown C, D < A' 
due to Y\ >~ c Y 2 . Therefore if we take the union of regions 
for the two distributions V and Vu x =u 2 =<f>< both 1Z 2 and TZ-{ 2 ) 
will have identical DExPs. By Lemma [T] 1Z 2 = 7^(2) ■ 

2. Proof of K 1 =K {1) 

This part is more involved, but the idea is essentially the 
same. We first show G is redundant. Given any V, consider 
another joint distribution Vxj\-4>- Then E and F become 

E'= (0, I(XyY 2 \U 2 ), min{/(X 2; y 3 ), I(U 2 ; Y 2 ) + I(X 2 ; Y 3 |«7 2 )}) 
F'=(o, min{J(Xi;Y 2 |!7 2 ), [J(Xi, C/ 2 ; Y 2 ) - /(C/ 2 ; Y 3 )] + }, 
I(X 2 ;Y 3 \U 2 ) +min{/([/ 2 ; Y 3 ), J(Xi, !7 2 ; Y a )}) . 

The region, specified by DExPs _E' and F', can be alternatively 
described by the following inequalities (this can be verified by 
setting U\ = <j> in 72-i) 

i?s </(^2;Y 3 ) 
H 2 < I(Xi;Y 2 \U 2 ) 
R 2 +R 3 <I(Xi,U 2 ;Y 2 ) + I(X 2 ;Y 3 \U 2 ) 
R 2 ,R 3 > 0, iii =0 

Using the fact I(Uy,Yi) < I{Ui;Y 2 ) due to Yi ~< Y 2 , it can 
be checked that G is contained in the above region. Hence G 
is redundant. 

Next we will show that H, I either reduce to other DExPs 
or are redundant if we consider all input distributions. Let 
us first focus on H. If I(U 2 ;Y 3 ) < I{U 2 :Y 2 \U 1 ), G = 
(0, I(U 1 ;Y 1 )+I{X 1 ;Y 2 \U ll U2), I(X 2 ;Y 3 )) and H — G. If 
S : I(U 2 ; y 2 |I7i) < /(C/ 2 ; Y 3 ) < I^U^Y^), we have 

F= (/(f/i;Yi), I(Xi,U 2 ;Y 2 \Ui) - I(U 2 ;Y 3 ), I(Xr,Y 3 j) 
H= (0, 7([/i;Yi)+7(Xi,[7 2 ;Y 2 |[/i)-/([/ 2 ;Y 3 ), /(X 2 ;Y 3 )) 

Notice that for any V, if condition 5 holds, then for Pu 1= <p, 
S still holds. Hence if we let U\ = <j>, F becomes F' = 

(0, I{X U U 2 ;Y 2 ) -I(U 2 ;Y 3 ), I{X 2 ;Y 3 )) and H < F> due 
to Y 1 < Y 2 . If 7(X 1) [/ 2 ;Y 2 |C/ 1 ) < 7([/ 2 ; Y 3 ) < 7(C/ 1 ;Y 1 ) + 
7(Xi, Z725 ^2|f7i), by setting U 2 = 4> w& have 

G'= (0, 7(f/i;Yi)+7(Xi;Y 2 |{7i), 7(X 2 ;Y 3 )) > H. 

We now consider 7. If I(U 2 ;Y 3 ) < I{U 2 ;Y 2 \Ui), I = 
E. If 7(£/ 2 ;Y 2 |£/ 1 ) < 7(t/ 2 ;Y 3 ) < 7(X X , [/ 2 ; Y 2 |[/ 1 ), 
7 = F. If 7(X 1 ,[/ 2 ;Y 2 |f/ 1 ) < 7(t/ 2 ;r 3 ) < 7(C/ 1 ;Y 1 ) + 
I(X ll U 2) Y 2 \U\), I reduces to 

/= (7({7 1 ;Y 1 ) + 7(Zi,«7 2 ;Y 2 |?7 1 )-/([/ 2 ;Y 3 ) ) 0, 7(Z 2 ;Y 3 )) 



If we further let U\ = X\ and U 2 = 0, E becomes E' = 
{liXx-Yx), 0, I(X 2 ;Y 3 )). Clearly, I < E' . 

At last, we consider J. For any V, setting U\ = U2 — 4>, 
E becomes 

E'= (0, I{Xi;Y 2 ), I(Xr,Y 3 j). 

Clearly, J < E' due to the condition I(Ui;Yi) + 
I(Xi, U2] Y2\U\) < I(U2\ Y3) and hence J is redundant. 

To summarize, even though for a specific V, 1Z\ could have 
more DExPs than R(\), if we consider all possible V, they will 
have exactly the same set of DExPs given by E and F. By 
Lemma Q] 1Z\ = R(i)- ■ 

V. Capacity Regions under the Strong/Very 
Strong Interference Condition 

In this section, capacity regions of DM-BIC with Y 1 -< Y 2 
and Y\ >~ c Y2 are established respectively for the strong and 
very strong interference conditions defined in the following. 

Definition 7: Interference is said to be strong if for all 
p{x x )p{x 2 ), I(X 2 ;Y 2 \X 1 ) > I(X 2 ;Y 3 ). 

Definition 8: Interference is said to be very strong if for all 
p(x 1 ) P (x 2 ), I(X 2 ;Y 2 ) > I(X 2 ; Y 3 ). 

Remark 5: The intuition behind these definitions, which are 
the same as the regular interference channel ifPTl . is that by 
conditioning on the intended signal, whose decoding is assured 
to be successful by design, the interfered receiver sees a better 
channel than interference's own receiver. This suggests that the 
interfered receiver should be able to decode the interference 
along with its intended signal, by performing a joint decoding 
if interference is strong. If further interference is very strong, 
successive interference cancellation decoding suffices, where 
interference is decoded first. Evidently very strong condition 
is stricter than the strong condition. 

Theorem 5: The capacity region of DM-BIC with Y\ -< 
Y2 and the strong interference condition is the closure of all 
{R\,R 2 ,Rz) satisfying 

Ri < I(Ui;Y!) 
R 2 < I(X l ;Y 2 \U 1 ,X 2 ) 
Rs < I(X 2 ;Y 3 ) 
R2 + R 3 < I(Xi,Xr,Y a \Ui) 
Ri,R 2 ,R 3 > 

for some V Ul: x 1 ,x 2 = p(ui)p(xi \ui)p(x 2 ). 

Proof: The proof is relegated to App. [G] ■ 

Remark 6: The capacity region takes two different forms. 
The one given in Theorem|5]is identical to TZ-m with U2 = X 2 . 
An alternative form is given by Tvli with U2 = X2. 

When receiver 2 is interference-oblivious less noisy than 
receiver 1, for any sensible coding scheme X\ should always 
be decodable at receiver 2 (otherwise, none of the broadcast 
receivers can do so). Hence the strong condition, originated 
from interference channel, naturally carries over to DM-BIC 
with Y\ -< Y 2 . However, this is not the case for DM-BIC 
with Y\ >- c Y2, which will be discussed next. 



Theorem 6: The capacity region of DM-BIC with Y\ >- c Y 2 
and the very strong interference condition is the closure of all 
(Ri,R2,Rs) satisfying 

Ri < I(Xv,Yi\Ui) 
R2 < I{Ui;Y a \X 2 ) 
R 3 < I(X 2 ;Y 3 ) 
Ri,R 2 ,R 3 > 

for some V UliXu x 2 ^ p(ui)p{x 1 \u 1 )p(x 2 )- 

Proof: The achievability follows those for Hn) and H2, 
all with with U 2 = X 2 - The converse proof is standard. ■ 

Remark 7: Similarly to Theorem [5] the capacity region 
takes two forms, one given in Theorem [6] which is essentially 
R(2) with U2 — X2, and another given by IZ2 with U% — A2. 

It is not difficult to see that strong condition in Definition 
|7] does not fit well for DM-BIC with Y x )~ c Y 2 . The reason is 
that if X\ is the intended signal for receiver 2, i.e. X\ always 
decodable at receiver 2, then by Y\ >- c Y2, receiver 1 can 
decode it as well. Hence the two receivers will always decode 
the same set of messages, which clearly does not represent the 
most general case. In fact, we claim that the strong but not 
very strong interference condition does not exist for DM-BIC 
with Y\ >~ c Y 2 . The argument is as the follows. 

The problem is to figure out what is the intended signal for 
receiver 2. Once we find out such a signal, we can mimic 
the strong condition in Definition [7] with modification of 
conditioning on that signal instead of X\. Suppose there exists 
some strong condition, then interference X2 is required to be 
decoded at receiver 2. Under this restriction, we have an upper 
bound n(R 2 + R 3 - e„) < I(W 2 , W 3 ; Y 2 n ). Along with other 
straightforward upper bounds, by the same technique that we 
used above to prove Theorem [5] we can show that R(2) with 
U2 = X2 is the capacity region. This implies that if there exists 
some strong condition, then superposition coding with cloud 
center U\ carrying receiver 2's message is capacity achieving. 
Hence without loss of generality, we can view the cloud center 
U\ as the intended signal for receiver 2, which in return gives 
us the strong condition /(X2; i^l^i) > -^(-^25^3), for all 
p(u 1 )p(x 1 \ui)p(x 2 ) such that Ui (X 1 ,X 2 ) -> (^2,^3) 
form a Markov chain. However, this condition always implies 
the very strong condition (consider U± = (f>) and furthermore 
the strong interference capacity region, R(2) with U2 = X2, 
always reduces to the very strong capacity region given in 
Theorem [6] In other words for Y\ >- c Y 2 , if interference is 
strong, then it has to be very strong. 

VI. Conclusion 

In this paper, we devise a coding scheme combining rate 
splitting, superposition coding and binning for a general DM- 
BIC. The obtained achievable rate region is then specialized 
to DM-BIC under two partial order broadcast conditions: 
interference-oblivious less noisy and interference-cognizant 
less noisy. By carefully inspecting the dominant extreme 
points, the specialized rate region is shown to be equivalent to 
that based on a simpler scheme that uses only superposition 



coding at the broadcast transmitter. For the interference- 
oblivious less noisy DM-BIC, if interference is strong, the 
capacity region is given by the aforementioned two equivalent 
rate regions. For the interference-cognizant less noisy DM- 
BIC, we argue that the strong but not very strong interference 
condition does not exist and in this case, we obtain the capacity 
region for very strong interference. 

Appendix A 
Proof of TheoremQ] 

We will first obtain an achievable rate region TZ in Lemma 
[6] Then we prove TZ = TZ. 

Lemma 6: TZ is an achievable rate region for DM-BIC, 
where TZ is the closure of all (Rx,R 2 ,R3) satisfying all 
inequalities defining 1Z plus two more constraints 

Ri < I(V 2 ,U 2 ;Y 2 \Ux,Q) + I(X 2 ;Y 3 \U 2 ,Q) (11) 
Rs < I{Vr, Yl \Ui, Q) + I(V 2 , U 2 ; Y 2 1 Ux, Q) + I(X 2 ; Y 3 \U 2 , Q) - 
I(Vi;V 2 \Ui,Q). (12) 

Proof: 
Codebook generation: 

Split Yi's message into two parts: mi and i. Sim- 
ilarly for Y-2, m 2 and j. Generate 2 n ( Rlc+R2c ) inde- 
pendent codewords u™ (mi, 7712) with each symbol i.i.d 
according to PUi(')> m i G {1, 2, .., 2 nRlc }, m 2 G 
{l,2,...,2 nR2 °}. For each u^(mi,m 2 ) generate 2 n( - R ^ +R '^ 
conditionally independent codewords u"(mi, m^, i, i') with 
each symbol i.i.d according to p Vl \ Ul (-\ui(mi,m 2 )), i G 
{l,2,...,2 nRl "}, i' e {l,2,...,2 nR '4- Similarly for each 
u^{mi,m%), generate 2 n ( R2p+R2 ' ) conditionally independent 
codewords u?? (mi, m2,i, j') with each symbol i.i.d accord- 
ing to PV2lUl (-\u 1 (m 1 ,m 2 )), j G {l,2,...,2 ni M> f £ 
{1,2,.., 2"^}. 

Split y 3 's message into two parts: k and /. Generate 2 nT3 
independent codewords v%(k) with each symbol i.i.d accord- 
ing to Pu 2 (')< k S {1, 2, 2 nT3 }. For each u 2 (k) generate 
2 nS3 conditionally independent codewords x^(k, I) with each 
symbol i.i.d according to Px 2 \u 2 (-\u 2 (k)), I £ {1, 2, 2" Ss }. 

Encoding: 

Given message quadruple (mi,i,m,2,j), broadcast trans- 
mitter tries to find a pair ) such that 

(vi(mi,m 2 ,i,i'),v 2 (mi,m 2 , £ A { "\Vx, V 2 ). 

If there is one or more such pairs, choose one and send 
x™ = f n (ui(m 1 ,m2),vl l (mi,m2,i,i'),v 2 l (m 1 ,m 2 ,j,j')), 
where /(•) is a deterministic function. If there is no such 
pair, an error is declared and a predefined codeword is sent. 
Interference transmitter sends codeword x^ik, I) for message 
pair (k, I). 

Without loss of generality, in the following we assume 
(mi, i, m 2 ,j, k, I) = (1, 1, 1, 1, 1, 1) is sent. 
Decoding: 

Receiver Y\ looks for (mi, rh 2 , such that 

(ui(Ai,m 2 ),vi(Thi,m 2 ,i,i'),yi) G Ai n \lfi, Vi, Yi). 

If there is no such quadruple or some such quadruple with 
either mi ^ 1 or i ^ 1 or both, an error is declared. 



Receiver Y 2 looks for (rhi,m 2 , j , f , k) such that 

(ui(7h 1 ,m 2 ),V2(m 1 ,rh 2 ,j,j'),U2(k),y2) G Ai n) (Ui, V 2 , U 2 , Y 2 ). 

If there is no such tuple or some such tuple with either rh 2 ^ 1 
or j 7^ 1 or both, an error is declared. 

Receiver Y3 looks for unique (k, l) such that 

(■uS(k),xS(k,i),yS) G A?\U 2 ,X 2 ,Y 2 ). 

If there is none or more than one such pair, an error is declared. 

Analysis of error probability: 

At broadcast encoder: Given (mi, m2, i, j), with high 
probability there is at least one pair such that 

(i)"(mi, m2, i, i'), U2 (mi, m 2 ,j, j')) is jointly typical if R[ + 
R' 2 > I(V\_; V 2 \Ui) due to mutual covering lemma ifTTI . 

At receiver Y\. Using standard techniques from [11], where 
all error events are first determined using a joint pmf fac- 
torization table and then analyzed individually using packing 
lemma, it can be shown that the error probability at receiver 
Y\ can be made arbitrarily small if 

R lp + R[ < J(Vi;Yi|*7i) 

Ric + R 2 c + Ri P + R[ < I(Vv, Yi). 

At receiver Y 2 : Similarly it can be shown that the error 
probability at receiver 2 can be made arbitrarily small if 

R 2p + R 2 < I{V 2 ;Y 2 \Ui,U 2 ) 

R 2p + R 2 +T S < I(V 2 , U 2 ;Y 2 \U!) 

Ric + R 2c + R 2p + R' 2 < I{V 2 ; Y 2 \U 2 ) 

Ric + R 2c + R 2p + R' 2 +T 3 < I(V 2 , U 2 ;Y 2 ). 

At receiver Y3: Similarly it can be shown that the error 
probability at receiver 3 can be made arbitrarily small if 

S3 < I(X 2 ;Y 3 \U 2 ) 
T3 + S3 < I(X 2 ;Y 3 ). 

Collecting all inequalities, applying Fourier-Motzkin elim- 
ination with Ri = Ric + Rip, R 2 = R 2c + R 2p and 
i?3 = T3 + 5*3, and finally including a time sharing variable, 
we obtain an achievable rate region TZ. ■ 

Proposition 2: Inequalities ( fTTb and ( fT2l are redundant. 
Therefore TZ = TZ. 

Proof: To prove that (fTTl i and (fT2l are redundant, we 
follow the argument used in 1 12 1 to simply the Han-Kobayashi 
region. Fix time-sharing r.v. Q. Denote V = "Pui,Vi,V2,u 3 ,X2- 

We first prove that (fTTl i is redundant. For a given V, we 
show that if rate triple (Rx, R 2 , R3) satisfies all inequalities 
in TZ except ( fTTl i. then (Rx, R 2 , R 3 ) G TZ-p U2=<t> - Hence by 
time-sharing, (fTTT i is redundant. 

If ( fTTb is violated, we have 

R3 > I(V 2 ,U 2 ;Y 2 \Ui) + I(X 2 ;Y 3 \U 2 ) (13) 

If (Rx, R 2 ,Rs) satisfies all inequalities in TZ except (fTTl i. then 
it can be shown that (Rx, R 2 , R3) satisfies the following 

Rx <I(Vx;Yx) 

R 2 <I(V 2 ;Y 2 ) (14) 

Rz<I(X 2 ;Y 3 ) 
Rx + R 2 <I(Vx;Y x \Ux) + I(V 2 ;Y 2 )-I{Vi;V 2 \Ux) (15) 
Rx + R2<I(Vx;Yx) + I(V 2 ;Y 2 \Ux)-I(Vx;V 2 \Ux), (16) 



where ( TBl i is obtained from ([T) and JTJI i, JT5t from (|2) and 
( [T3l >. ( [Tol l from © and (fT3T >. Notice that the above region is 
exactly T-l-Pv Hence ( fTTT i is redundant. In the following we 
assume ( fTTT ) has already been removed. 

The case of ( fT2l ) is a little bit involved due to the constraint 
©. Let us first consider the following two statements: V 
satisfies |@); V satisfies (|4]i where is removed. If the latter 
statement is true, so is the former, but not vice versa. In the 
following, we will first focus on a class of V satisfying both. 

We prove that (TT~2T > is redundant using a similar argument 
for ([Til l. If ([T2l is violated, we have 

7? 3 > I(Vv,Yi\Ui,Q) + I(V 2 ,U 2 ;Y 2 \Ui) + I(X 2 ;Y 3 \U 2 )- 

I(V 1 ;V 2 \U 1 ). (17) 

If (Ri, i?2, -R3) satisfies all inequalities in 7Z except (fT2b . then 
i?2 7 -R3) G T^v U2=4> - Again to obtain (fT4] i. we use (Q]) and 
dTvT > and have 

R 2 < I(V 2 ;Y 2 ) - [J(Vi; Yi|l7i) + I(V 2 ; Y 2 \Ut) - J(Vi; V 3 |Z7i)] 
</(Va;y a ), 

where the last inequality is due to the fact that V satisfies 
where U 2 is removed. Similarly ([l"5t follows from (fj) and 
( fTTT i and ( TToT ) from (01 and ( fTTT i. Hence by time-sharing, (fTZt 
is redundant for this class of V . 

Next we focus on a class of V satisfying (0| but not 
without J7 2 , i.e. 

/(Vi; Yi \Ui) + I(V 2 ; Y 2 \U X ) - I(Vv, V 2 \Ui) < 0. (18) 

As we can see, an attempt to repeat what we have done 
previously fails in this case since Vu 2 =4> is not a valid joint 
input distribution. However, a careful examination of ([18} 
reveals the truth that this particular V is simply a bad choice 
for the binning coding because the penalty term I(Vi; V 2 \Ui) 
arising from having correlated inputs is so large that we 
might have done better provided no binning coding is used. 
Hence we consider Vui=Vl> where essentially we only make 
use of superposition and the binning aspect is not present 
resulting in I(Vi;V 2 \Ui) = 0. From ([T8~l i. especially two 
derived conditions I(Vi; Yi|Z7i) - I(V 1 ;V 2 \U 1 ) < and 
I{V 2 ;Y 2 pi) - I(V 1 ;V 2 \U 1 ) < 0, it can be checked that 
IZ-p C "TL-pu =v f° r tr, i s class of joint distributions. Then by a 
similar argument for ([PH . it can be shown that any rates that 
violate (FPU) automatically fall into Tt-p u =v u ^. Hence by 
time-sharing, ( fT2b is redundant. ■ 

Appendix B 
Proof of CorollaryE] 

Proof: 

Consider a region 1Z which is the same as 7Z in Lemma [6] 
App. [A] except that (IT2b is removed. Since dTTT > and (TTSl i are 
redundant by Proposition [2] App. [A] we have 7Z = 1Z = 1Z. 
Now fix Q, evaluate 1Z with X\ = V\, V2 = U\ to obtain a 
region specified by the same inequalities defining 1Z 2 plus one 
extra inequality, 

R3 < I(U 2 ; Y 2 1 Ux ) + I(X 2 ; Y 3 \ U 2 ) . 

Using the same argument in Proposition [2] App. [A] we can 
show that this inequality is redundant. ■ 



Appendix C 
Proof of Proposition[TJ 

Proof: 

We prove for 1Z\. The case of 1Z 2 follows similarly. 

Let Q take two values 1, 2 with probability a and a, < 
a < 1. Denote two sets of [U{,X{, U%, X\,Yi,Y 2 % ) where 
i = 1, 2. For Q = i, define U^ Q = U{, U 2 ,q = V\, X x = X\, 
X 2 = X\,Y\ = Y{ and Y 2 = Yj. Then we have Markov chain 

(q,u 1iQ ,u 2 ,q) -»• (x u x 2 ) -> (r 1; y 2 ). 

For the 1st inequality in 72-1, 

a/C^jY! 1 ) + a/((7i 2 ; Y 2 ) = /([/ llQ ; Y 2 |Q) < I{U llQ ,Q; Y 2 ). 

For the 2nd inequality in TZ\, 

aI{Xl;Yi) + aI{Xl;Yi) 

= I(X 2 ;Y 3 \U 2 , Q ,Q) + I(U 2 , Q ;Y 3 \Q) 

< I(X 2 ;Y 3 \U 2 , Q ,Q) + I(U 2 , Q ,Q;Y 3 ) = I(X 2 ;Y 3 ) 

Similarly, we can show that the convex combinations of 
the right-hand sides of the 3rd, 4th inequalities in IZi are 
respectively less or equal to 

I{U\,q,Q; Y) + I(X l ; Y 2 \U UQ ,U 2 , Q , Q) 

I(Ui,q, Q; Yl) + I(Xi, U 2 ,q, Q; Y 2 \U\,q, Q) + I(X 2 ; Y 2 \U 2 ,q, Q) 

Redefine U x = (U\,q, Q) and U 2 = (U 2 ,q,Q). We see 
that the time-sharing region is always contained within 1Z X 
for some V Vl >Xl , u 2 ,x 2 ■ ■ 

Appendix D 
Proof of Lemma[2] 

Proof: We will use the following notations. 51 denotes 
the set of all DExPs of 1Z 2 . For some predefined R[, denote 
n 2 (R[) = {(R 2 ,R 3 ) : (R[,R 2 iR 3 ) e K 2 } and the corre- 
sponding set of all DExPs f^i?^). Similarly, we could also 
define H 2 {R' i ) and n(R' t ) for i = 2,3, and K 2 {R' k ,R\) and 
Sl(R' k , R[) for k, I = 1, 2, 3 and k < I. 

TZ 2 is given by a system of linear inequalities. Since DExPs 
are ExPs by definition, which can be found by solving the 
system of linear equations given by some active constraints, 
one approach to find 51 is to consider all possible combinations 
of active constraints whose corresponding system of linear 
equations admits a unique solution and then compare the 
obtained ExPs one by one. There are totally eight inequalities 
in 1Z 2 making this approach tedious. Fortunately, we can make 
use the property of DExPs to simply the procedure and make 
it more systematic so that we don't overlook any DExP. 

Let R* denote the largest admissible Ri in 1Z 2 . Then DExPs 
can be sorted into four categories: 
Case 1: (Rl,R 2 ,R 3 ) e 51 for some R z < R*, i = 2, 3 
Case 2: (R 1 ,R^,R 3 ) e 51 for some R z < R*, i = 1, 3 
Case 3: (R 1 ,R 2 ,R^) G 51 for some R t < R*, i = l,2 
Case 4: (R U R 2 ,R 3 ) G 51 for some R % < R*, i = 1, 2, 3 

Note that Case 1, 2, 3 are not mutually exclusive. The point 
of a such division is, by considering Case 1, 2, 3, a higher 
dimensional (n = 3) problem can be reduced to a lower one 
(n = 1 or n = 2) and for the irreducible Case 4, the additional 
constraints Ri < R* will simplify the problem. This point will 
be made clear as we proceed in the following. 



Case 1: 

The largest admissible R{ = I{Xr, Yi\Ux) + I(U 1 ;Y 2 \U 2 ) 
is obtained by setting R 2 = in ©. Fixing R[ = R*, R^ = 0, 
the following two statements are equivalent 

(R[,R' 2 ,R 3 ) efi^R 3 e n(R[,R' 2 ). 

Since 1Z 2 (R[, R' 2 ) is one dimensional, we have Q,(R^,R 2 ) = 
{™p R3en3(W ^Rs} = {mm{I(X 2 ;Y 3 ),I(U 2 ;Y 2 ) + 
I(X 2 ; Y 3 \U 2 )}}, which gives us D. 
Case 2: 

The largest admissible R 2 — I(Ui;Y 2 \U 2 ) is given by ©. 
Fixing R' 2 = Rl, we have (R 1 ,R' 2 ,R 3 ) G il iff {Ri,R 3 ) G 
fi(i?0 and 

ft 2 (7? 2 ) = 

(Ri,R 3 ) :R 1 ,R 3 >0, R! </(Xi;Yi|{/i)=a 
i? 3 <min{/(X 2 ;Y 3 ), /(C/ 2 ; Y 2 ) + /(X 2 ; Y 3 \U 2 )} = bj 

is two dimensional. It is easy to see that £l(R' 2 ) — {(a, &)} 
yielding A. 
Case 3: 

The largest admissible R 3 is given by R^ = 

mm{l(X 2 ;Y 3 ), I(U U U 2 ; Y 2 ) + I(X 2 ;Y 3 \U 2 )} . If 
R* 3 = I(X 2 ; Y 3 ), fixing R' 3 = i?J and we have 

K 2 {R! 3 ) = 
( (Ri,R 2 ) : Ri,R 2 > 
R 2 <mm{/((7i;Y 2 |[/ 2 ), /(t/i, i7 2 ; y 2 ) - J(t/ 2 ; Y 3 )} = c 
Ri+R 2 < I(X 1 ;Yi\Ui) + mm{I(Ui;Y 2 \U 2 ), 
I(U 1 ,U 2 ;Y 2 ) - I(U 2 ;Y 3 )} 4 d 

Note that /(C/i, J7 2 ; Y 2 ) - /(^a; ^3) > in this case. It is easy 
to see Sl(i?g) = {(d— c, c), (d, 0)}, resulting in two DExPs: 

B' = (7(Zi;Yi|{7i), min{I(£/i;Y 2 |l7a), J(l/i, C/ 2 ; Y 2 )- 

/(t/ 2 ;Y 3 )}, I(X 2 ;Y 3 )) 
C = (/(Xi;Y 1 |(7i) + miri{J(t/ 1 ;Y 2 |(7 2 ), I{U^U 2 ;Y 2 )~ 

I(U 2 ;Y 3 )}, 0, J(X a ;Y 8 )) 

If i?5 = I{U U U 2 ;Y 2 ) + I{X 2 ;Y 3 \U 2 ), which is given by 
([Tol l by setting R 2 = 0, fixing = 0, R' 3 = R 3 and we 
obtain K 2 (R' 2 ,R 3 ) = {R% : < R x < I(Xv, Yi|Ui)} and 
n(R' 2 ,R 3 ) = {J(Xi;Yi|J7i)}. Hence we obtain one DExP 

£' = (/(X^Yilt/i), 0, J((7i,(7 2 ;Y 2 ) + /(X 2 ;Y3|l7 2 )). 

Combing the two cases, we rewrite B' and E 1 collectively as 
B and C and _E' collectively as C. 
Case 4: 

Under the condition R4 < R%, i = 1, 2, 3, 7?-2 is given by 



i?i </(X 1 ;Y 1 |C/ 1 ) + /(f/ i; F 2 |[/ 2 ) (19) 

R 2 <I{Vr,Y 2 \V 2 ) (20) 
7? 3 < min{/(X 2 ;Y 3 ), I(U X , U 2 ; Y 2 )+ 

I(X 2 ;Y 3 \U 2 )} (21) 

Ri+R 2 <I(X 1 ;Y 1 \U 1 ) + I(Uv,Y 2 \U 2 ) (22) 

i? 2 + 7?3 < I(U U U 2 ;Y 2 ) + I{X 2 ;Y 3 \U 2 ) (23) 
fli +i? 2 + 7? 3 < I(X 1 ;Y 1 \Ui) + I(Ui,U 2 ;Y2)+ 

I(X 2 ;Y 3 \U 2 ) (24) 

Ri,R 2 ,R 3 >0. (25) 



i? 2 

(a) (b) 

Fig. 2 



As mentioned before, DExPs are the solutions of systems of 
linear equations given by some active constraints. Hence we 
first consider all possible combinations of active constraints 
defining dominant faces, i.e. (f22l-(l24l and then add additional 
active constraints from (l25t as needed to ensure the resulting 
system has a unique solution. 

If 423, (|23]), (|24]l are all active, from (|23j, (EUl we get 
i?i = /(X^Y^Ui) and further with <|22), we get R 2 = 
I(Ui; Y 2 \U 2 ), which violates ([20}. If only (22), (|23]) are active, 
since the corresponding system of linear equations does not 
have a unique solution (more variables than equations), we 
choose one additional active constraint from ( |25V However, 
the obtained solution violates either (1 191 , d20l > or (fJTJ. We can 
proceed similarly for the remaining six possible combinations 
and none of them yields a valid DExP. Overall we conclude 
that there is no DExP in Case 4. ■ 

Appendix E 
Proof of Lemma[3] 

Proof: We use the same notations from the proof of 
Lemma [2 which are now defined over R-( 2 ) instead of 1Z 2 . 
As we can see, Ri is disassociated with R 2 , R 3 . Hence the 
DExPs of 7£(2) is of the form (R{, R 2 , R 3 ), where R{ = 
IiXtiYxpx). Fixing R[ = R* l7 we have 

r {R 2 ,R 3 ) : R 2l R 3 > 
IZ^iRi) = I R 2 < I{Ur,Y 2 \U 2 ) 4 a , R 3 < I(X 2 ;Y 3 ) 4 b 
{ R 2 + R 3 < I(U U U 2 ;Y 2 ) + I(X 2 ;Y 3 \U 2 ) 4 C J 

If c > b, i.e. I(U U U 2 ;Y 2 ) > I(U 2 ;Y 3 ), n {2) (R[) is de- 
picted in Fig. |2](a) and tt{R[) = {Ti,T 2 } = {fa, min{fe, c - 
a}), (min{a, c — b},b)}, yielding two DExPs, A and 

B' = (l{Xx\Yx\Ui), min{I(Ui;Y 2 \U 2 ), I(U!,U 2 ;Y 2 )- 
I(U 2 ;Y 3 )}, I(X 2 ;Y 3 )) 



If c < b, i.e. I(U U U 2 ;Y 2 ) < I(U 2 ;Y 3 ), K {2) (R{) is de- 
picted in Fig. |2](b) and n(R[) = {Ti, T 3 } = {(a, mm{b, c - 
a}), (0,c)}, yielding one more DExP of 7J( 2 ) 

C"= (/(Xi^t/i), 0, /((7i,f/ 2 ;Y 2 ) + J(X 2 ;Y3|;7 2 )). 

Note that B' and C can be rewritten collectively as B. ■ 



Appendix F 
Proof of Lemma|4] 

Proof: Since the argument is similar to that of Lemma 
12 we use the same notations, which are now defined over 
IZi instead of 1Z 2 . Again the DExPs can be sorted into four 
categories. We next discuss case by case. 
Case 1: 

The largest admissible R\ = I{Ui]Y x ). Fixing R[ = R$, 
we have (R' l7 R 2 ,R 3 ) e fi iff (R 2 ,R 3 ) £ tt(R[) and 

f (R 2 ,R 3 ) :R 2 ,R 3 >0 I 
fti(iii) = I R 2 < I(Xv,Y 2 \Ui,U 2 ), R 3 < I(X 2 ;Y 3 ) \ . 

{ R 2 + R 3 < I(Xi,Ur, Y 2 \Ui) + I(X 2 ; Y 3 \U 2 ) J 

Similar to the proof of Lemma [5] there are two DExPs: E, F. 
Case 2: 

The largest admissible R* = I(J7i; Yi) + I(X 1 ;Y 2 \U 1 ,U 2 ) 
is obtained by setting Ri = in ©. Fixing R\ = and 
R' 2 = R* 2 , we have Ki(R' v R' 2 ) = {R 3 : < R 3 < 
mm{I{X 2l Y 3 ), I(U 2 ;Y 3 \Ui) + I{X 2 ;Y 3 \U 2 )}}, resulting in 
point G. 

Case 3: 

The largest admissible R% = min{I(X 2 ;Y a ), 7([7i;Yi) + 
I(X u U 2 ;Y 2 \Ui) + I(X 2 ;Y 3 \U 2 )}. 

1. If I(U 2 ;Y 3 ) < I(Ui\Yx) + I(X 1 ,U 2: Y 2 \U 1 ), R* = 
I(X 2 ;Y a ). Fixing R 3 = R%, we have 

^1(^3) - 

' (R u R 2 ):R u R 2 >0, Ri<I(Ux;Yi)±a 

< fli+i«2</(l/i;li) + /(Jri;l2|l/i,l/2)+ 

min{0, I(U 2 ; Y 2 \Ui) - I{U 2 ; Y 3 )} 4 ^ 

Note that > in this case. This case is similar to that in 
Lemma [5] and we can show 

1) If I(U 2 ;Y 3 ) < IiXuU^lUx), n(R' 3 ) = 
{(0,6), (a, b -a)} 

2) If HXuUxYipi) < I{U 2 ;Y 3 ) < /(t/ijYl) + 
I(X X ,U 2 ; Y 2 \U X ), fi(J^) = {(0, 6), (6, 0)} 

Finally, we collectively write the obtained DExPs as 

H= (0, b, R*^, /= (miri{a,&}, [b - a] + , R*^. 

2. If I(U 2 ;Y 3 ) > IiU^Yx) + IiXx^Y^Ux), R* 3 = 
I{Ui;Y x ) + I(Xi,UxY 2 \Ux) + I(X 2 ;Y 3 \U 2 ), which is ob- 
tained by setting R\ = R 2 = in @. In this case we find 
one DExP J. 

Case 4: Similar to that for Lemma [2] it can be shown that 
there is no DExP in this case. ■ 

Appendix G 
Proof of Theorem[5] 

To prove the converse, we use the technique proposed in 
[14|. Specifically, we need the following lemma that can be 
easily proved using the same arguments for [14. Lemma 1]. 

Lemma 7: In a DM-BIC with Y x < Q Y 2 , if W ->■ 
{X?,X%) (Yf, Kf) form a Markov chain, then the 
following holds: 

ItY^-Y^W) > I(Yt l ;Y 2ti \W), \<i<n. 



Proof of Theorem \5} 

The achievable schemes are given by the coding schemes 
for 1Z\ in Corollary Q] and TZ(i) in Theorem [2] respectively, all 
with U 2 = X 2 . 

For the converse, we define Ui = {W x , Y^~ ). For some e„ 
such that lirrin^oo e„ = 0, by Fano's inequality, we have 

n(R x -e n ) <I(Wi;Yr) 

n n 

= J2 I (Wi;Y 1 , t \Yr 1 )<J2 I ( U ^)- 

1=1 1=1 

To bound R 2 , we proceed as the follows 

n{R 2 - e n ) < I{W 2 ;Y 2 n \W 1 ,XS) 

n 

^Y.IiX^-Y^WuX^^Yt 1 ) 

71 

= Y,HXi,i;Y 2 , i \Wi,x 2 ,i) -iiyt 1 ; Y 2 ,i\w ll x 2 ,i) 

(a) n 

< ^/(x 1 , i; y 2 , l |Wi,x 2 , l )-/(vr 1 ;^, l |H/i,x 2 , l ) 
i=i 

n 

= Y / nxi,i;Y2,i\Ui,x 2 , i ), 

1=1 

where (a) follows from Lemma [7] 

Now we consider upper-bound for R 2 + R 3 . The strong con- 
dition implies I(X r 2 l ;Yi l \X[ l ) > 7(X£;Y 3 n ), E3 Lemma]. 
Proceeding, 

n(i? 2 + R 3 - e n ) < I(W 2 ; Y 2 n ) + I(W 3 ; Y 3 n ) 

< 7(Xr; KTIVKi) + /(X 2 n ; F 2 "|X a ") 

n 

= ^ I{X hl ,X 2 y, Y^W^Yt 1 ) 

!=1 

n 

= ^/(Xi, i ,X2, < ;y2,i|Wi)-/(yr 1 ;y 2 ,i|Wi) 

(c) n 

< ^i(Xi, i ,x 2 ,i;y2,i|w r i) -/(yr 1 ;^,^!) 

n 

= 5^I(Jfx,i,Jf2,i;y 2 ,i|l7i), 
j=i 

where (6) is due to the strong condition and (c) is due to 
Lemma [7] 

Finally, we have n(R 3 - e n ) < Yh=i 1 ( X %V> Y 3 ,i). The 
proof is complete by redefining U = (Uq, Q), Xj : i = Xj for 
j = 1, 2, and Yj^ = Yj, for i = 1, 2, 3, where Q is a uniformly 
distributed r.v. on (1, n). ■ 
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